October 2017

:~$ whoami

Matthias Bannert

  • current occupation: data scientist / software developer @ETH Zurich
  • occasional consultant
  • studied economics @UniKN, PhD @ETHZ: partly economics, mostly methodology + stats
  • CTO of Swiss startup fanpictor from 2012-2014
  • open source software projects: timeseriesdb, tstools, dropR, RAdwords
matthias bannert

About this course

# listen - forget
# see - remember
# do - understand

Overview

  • Day 1: Organize
    • Introduction
    • Data Generating Processes
    • Types of Data
    • Manage and Archive
  • Day 2: Process and Communicate
    • Visualization
    • Methodology

Background Poll

Inspiration: Illustrate

mobile evolution

Inspiration: Relation

million lines

Inspiration: Choropleth

five percent

Inspiration: Draw R

Inspiration: Process Data

  • download automatically
  • read spreadsheet
  • process
  • visualize

Inspiration: Dynamic Reporting / Presentations

  • create report
  • dynamic figures & tables
  • html, pdf, docx

Data Analytics Toolbox

Getting Started

"Premature optimization is the root of all evil."
Donald Knuth

But …

The R Ecosystem

  • R Interpreter
  • R Studio IDE / R Studio Server
  • CRAN Repository /w 10K+ packages
  • Community: stackoverflow, Mailinglist, …